Re-Encoding Techniques for Interpolation-Based 
Decoding of Reed-Solomon Codes 



Christian Senger 

Institute of Communications Engineering, Ulm University, Ulm, Germany 
Email: christi an. senger@uni-ulm.de 



CO 

o 

CN 
00 



Abstract — We consider interpolation-based decoding of Reed- 
Solomon codes using the Guruswami-Sudan algorithm (GSA) 
and investigate the effects of two modification techniques for re- 
ceived vectors, i.e., the re-encoding map and the newly introduced 
periodicity projection. After an analysis of the latter, we track 
the benefits (that is low Hamming weight and regular structure) 
of modified received vectors through the interpolation step of the 
GSA and show how the involved homogeneous linear system of 
equations can be compressed. We show that this compression as 
well as the recovery of the interpolated bivariate polynomial is 
particularly simple when the periodicity projection was applied. 



I. Introduction 

The discovery of the Guruswami-Sudan list decoding al- 
gorithm (GSA) IT] was one of the major breakthroughs in 
algebraic coding theory. Building up upon the original Sudan 
list decoding algorithm [2|, which is restricted to the prac- 
tically less interesting case of low code rates, it manages to 
decode Reed-Solomon codes (and, more generally, Algebraic 
Geometry codes) of arbitrary rate beyond half their minimum 
distance with polynomial time complexity in the code length. 
It was pointed out in UJ that the two algorithms are closely 
related to each other and that the older algorithm is a special 
case of the newer one. It can be attributed to the work of 
Gemmell and Sudan [3| that the classical bounded minimum 
distance decoder of Welch and Berlekamp [4] is a special case 
of both. The latter fact is nicely elaborated in J5] Sections 5.2 
and 12.2]. 

Guruswami and Sudan did not aim to reduce the degree 
P |~J ,of the complexity polynomial and, as we will see in Sec- 
J> ' tion IIVI their original algorithm is in O [s 3 ^ 3 n 3 ] , where n 
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is the code length and s, t are two interdependent parameters 
to be explained later. Since 1999, several authors proposed 
improved versions of the algorithm, whose time complexities 
are quadratic in the code length instead of cubic. Most 
improvements are based on the fact that the computationally 
most expensive task of the algorithm is bivariate polynomial 
interpolation, which can be realized as a linear system of 
equations. The complexity of solving general linear systems 
using Gaussian elimination is cubic in their size, but it has 
been observed that the linear system which appears in the 
GSA is a rather structured one. 

Among the fastest realizations of the interpolation is the 
solution of to Augot and Zeh J5J, see also 0. It is based 

This work has been supported by DFG, Germany, under grant BO 867/22-1. 



on an adaptation of Feng and Tzeng's Fundamental Iterative 
Algorithm (FIA) [8| for block Hankel matrices, which is in 
turn due to Roth and Ruckenstein (5J. Its time complexity 
is in O [Zs 4 n 2 ] . Other solutions have been presented by 
Alekhnovich [10] (Diophantine equations), Olshevsky and 
Shokrollahi ifTTl (matrix displacement/Schur complement), 
and Trifonov [12| (Grobner bases). More references can be 
found in 02] Chapter 9]. 

Another technique to reduce the complexity of the GSA is to 
reduce the size of the enclosed interpolation problem. This re- 
encoding approach was followed by Gross et al. [ 14], Ma lfl5l . 
and Kotter et al. Ifl6l . In their papers, bivariate interpolation 
is done using polynomial-based algorithms like the Kotter 
algorithm lfl7l . 

Our contribution is the description and analysis of a partic- 
ularly simple case of re-encoding — the periodicity projection 
— which allows to compress the size of the interpolation 
problem using especially sparse data structures while at the 
same time it maintains its regular structure. We conjecture 
that this is useful for the complete GSA, but in this paper 
we focus only on the involved interpolation. Along the way, 
we investigate the effect of general re-encoding on the linear 
system and make it applicable for matrix-based interpolation 
as in EE). 

The rest of the paper is organized as follows. In Section [TT] 
we define some basic notions and recall the re-encoding map. 
Section [III] is devoted to periodic vectors and the periodicity 
projection. Besides their fundamental properties we also ex- 
plain their relation to the re-encoding map. In Section HVI we 
shortly recapitulate the GSA, before we investigate the GSA 
when it is applied to sparse and structured received vectors as 
created by the re-encoding map and the periodicity projection 
in Section [V] and show how it can be sped up. Section [VI] 
concludes the paper with some closing comments. 

II. Reed-Solomon Codes and Re-Encoding 

Definition 1 Let q be a prime power, n = q — 1, and let ¥ q 

be a finite field with q elements. Let further a £ ¥ q be a 
primitive element. The Discrete Fourier Transform (DFT) of 



(vo, ■ 



-i) e F" is &[v] = V = (Vo, . • • , V n -i) e 



F™, where 



V, 



n-l 
i=0 



and V is denoted as a frequency-domain vector. The Inverse 
Discrete Fourier Transform (IDFT) of V is J^ _1 [V] = v, 
and it holds 

n-l 

Vi^n-^VicT*. (1) 

The vector v is referred to as a time-domain vector. 

Note that the DFT can be interpreted as the vector-matrix 
multiplication & [v] = v& with annxn Vandermonde matrix 
& = fa -3 '*) . . Using this interpretation, the IDFT becomes 

[V] = V&- 1 . 

Definition 2 For parameters n, k G N with k < n = q — 1, 
a primitive Reed-Solomon (RS) code over ¥ q can be defined 
as 

TZS (F g ;n, k, d) = 

| [C]:C= ( Co,.. C fc -i ,p,...,0),C J - 6 F g 1 . 

I. k times n—k times ) 

Length one/ dimension o/ f/je code are given by n and k, 
respectively. TZS is linear, i.e., for /3i,/?2 G F 9 /ioWs f/ze 
implication C\Ci G 7?.5 /3iCi + P2C2 G 7?.<S. 

It is a well-known fact that for the minimum distance d 
of RS codes holds equality in the Singleton Bound, i.e., 
d = n — k + 1. Since the minimum distance of a code is 
defined as the minimal number of positions in which any two 
codewords C\ and C2, c\ ^ C2, differ, it immediately follows 
that any codeword is uniquely determined by any k of its 
positions. This is commonly denoted as the Maximum Distance 
Separable (MDS) property. 

Transmission of a codeword c G TZS over a channel results 
in the reception of a received vector r = c + e, which is 
distorted by an error vector e. Inspired by the MDS property, 
we can define the following simple mapping. It allows to 
map any vector r received from the transmission channel to 
a modified received vector with beneficial properties for the 
decoding. 

Definition 3 For an RS code TZS (¥ q ; n, k, d) let J = 

{jo> • ■ • > 3 0-1} be a set of a positions, < jt < n — 1. Then 
the re-encoding map with regard to J is 

f F" 4 F" 

[ V H- V + c 

where c — (co, . . . , c„_i) G TZS, such that for all j G J 
holds Vj — Cj. 

If a = k (the only practically relevant case), then Mj is 
a projection, i.e., it is idempotent. This can be seen by the 
fact that all positions j G J in S%j [v] = v + Ci are zero 
by definition. But then &Hj[&j[v]] = v + c\ + C2, where 
C2 G 7?.^ is zero at the k positions in J, which is only possible 
if it is the all-zero codeword. 



The effect of re-encoding when applied to a received vector 
r is straightforward to see. It maps r = c + e to 

3!,j [r] = c + c +e, 
=c'eizs 

i.e., to another received vector with at most n — a non-zero 
positions. We will see in Section |IV] why this is useful. It 
should be clear that a codeword c can be efficiently calculated 
using an erasures-only decoder as long as a < n — k. It 
should also be clear that the transmitted codeword c can be 
recovered after successful decoding of the modified received 
vector 2%j [r]. 

Re-encoding is a well-known concept in algebraic coding. 
It has been used implicitly by Welch and Berlekamp in J4) in 
order to decrease the complexity of their interpolation-based 
bounded minimum distance decoder and has been revitalized 
in the context of interpolation-based list decoding IfPfl - lPRn . 

III. The Periodicity Projection 

This section is devoted to a mapping technique for received 
vectors first proposed in [18 1. It is based on a certain property 
of the DFT, which we recall as Theorem Q] We show at the 
end of the section that this technique is a special case of re- 
encoding as in Definition [3] 

Definition 4 Let n = q — 1 for a prime power q and let 
p G N \ {0} such that p \ n. A vector V G F^ of the form 

V = (T 1 __T), 

n / P times 

where T is a template vector of the form 

T=(T ,...,T P _ 1 ), 

is denoted as a p-periodic vector. 

In this paper, all p-periodic vectors are frequency domain 
vectors without particularly mentioning it. The following the- 
orem relates p-periodicity in frequency domain to sparsity in 
time domain. It can be seen as a less prominent property of the 
DFT next to widely known properties as, e.g., the convolution 
property [19 , Theorem 6.1.3] or the polynomial root property 
|fT9l Theorem 6.1.5]. 

Theorem 1 A vector V — (Vo, • • • , Ki-i) w p-periodic with 
template vector T — (To, ■ • ■ , if an d only if its time- 

domain counterpart is v — (vq, . . . , U n _i) = J? -1 [V] G F™, 
where ^ 

J pY^Ts*^ if*\3 

{ if^l3 

Proof: By Definition (Q]) Vj = n~ x Y^h=o ^i Q;_y > which 
can be written in terms of the template vector as 

"i-*- 1 E 1P E^(^) tp+s - ( 2 ) 

t=o s=a 



Let us write j = ^ n /p + v with /i,i/£N. Then the summand 
can be written as 



tp+s 



rp —V nt P/p — V ns /p — l'tp — l'S 

= a~ iyt PT sQ; -' i *(9- 1 )- s ( M "/p+ I/ ) 



(3) 



where (*) follows from the fact that for any j3 £ ¥ q holds 
= 1 (Lagrange's Theorem). In case n /p \ j, we have 
v = and j = i^ n /p. Hence, using 01, 0) becomes 

™/p — i p— i p— i 

t=0 s=0 s=0 

which proves the first part of the statement. If n /p / j, then 
i/£N \ {0} and, again using 01, (0 becomes 



"/p-i 



p-i 



- 1 £ cr v *>Y, T > 



a 



-s(^ia+!/) 



t=0 



s=0 



where we can exchange inner and outer summation in order 
to obtain 

v 3 =n- 1 Ys T s a ~ S( ^ h+u) E 



s=0 



t=0 



But 



-Vp-1 



(1 -a- rl!/ ) _ (1 



a 



(9-1)^^1 



= 0, (4) 



since the sum is a geometric series, proving that Vj 







whenever n /pj(j. ■ 
It is interesting to note that periodicity is maintained by 
cyclic convolution with arbitrary vectors. This simple corollary 
can be easily seen using the convolution property of the DFT 

Corollary 1 Let V = (V ,. . . , 14- 1) S F™ be p-periodic and 
W = (Wo, . . . , W n -i) £ ¥q be arbitrary. Then the cyclic 
convolution U = (Uo, . . . , f/n-i) of these vectors, where 

j 3 



i=0 



i=0 



is p-periodic. 



Definition 5 Let p G N \ {0} such that p \ n. Then the 
periodicity projection with regard to p is defined as the map 

F™ -4 F" 

q g 

« H4 ^-M(T 1 __T)] , 

™/p rimes 

w/zere T = (T4_ p , . . . , K-i) and ^ [«] = (Vo, • • • , K-i). 

The map £P p is indeed a projection, i.e., it is idempotent. 
This follows from the fact that its values depend only on the 
rightmost p positions of the frequency-domain counterpart of 



the input vector v and these positions are not affected by ZP p . 
Note that the periodicity projection can be interpreted as a 
linear operator 3? = &~ 1 P& with a sparse matrix 



P = 



0„ 



V 



/ 



Here, denotes the all-zero matrix and 1 the identity matrix. 

Theorem 2 Let US (¥q;n,k,d) be an RS code. Let further 
p G N \ {0} such that p \ n, and p > d — 1. If c € TZS is 
a codeword, e £ F™ is an error vector of Hamming weight 
wtn [e] = e, and r = c + e is the received vector, then 

[r] = d + e, 

where 

d = c+ &> p [r\-r 

and d € 1ZS with wtn [d] < p + e. 

Proof: Let R = (R , . . . , i?„_i)^ [r] be the frequency- 
domain counterpart of r. By Definition 

^W = ^ _1 [(T T)], (5) 

n /p times 

where the template vector T — (i?„_ p , . . . , R n -i) consists of 
the p rightmost components of R. If we group the components 
of r into n /p blocks of length p, i.e., R = (Rq, . . . , RnU-x), 



then T = R, 



&> P [r] 



,/ p _i and consequently 
Ro, ■ ■ 



T-R n/p _ 2 ,Z)], 



where Z is the all-zero vector of length p. Since we assumed 
p > d — 1, it follows from Definition that c = [r] — 
r* e 7?.5. The first part of the claim follows from substituting 
r = c + e and the linearity of 1ZS. As for the second claim, it 
follows from Theorem Q] that wtn \&p H] < £>■ But ^ p [r] = 
c' + e, wtn [e] = e. Thus, c' differs from [r] in at most 
e positions and the bound wtn [c'] < p + e follows. ■ 
We emphasize that for r = c + e and 3^ v [r] — d + e as 
in the theorem generally holds £P p [c] ^ d and 3? v [e] ^ e. 
From Theorem and the proof of Theorem we can imme- 
diately extract the following simple corollary. 

Corollary 2 The periodicity projection is a special case of 
the re-encoding map with a = n — p and J = {j : n /pj( j}. 

IV. The Gurus wami-Sudan Algorithm 

The GSA 0] can be divided into two steps, the interpolation 
step (Problem and the factorization step (Problem 0. Our 
focus here is on the interpolation step, which is computation- 
ally more involved. Let in the following c 6 1ZS (¥ q ; n, k, d) 
be a codeword, e 6 F™ be an error vector of Hamming weight 
wtn [e] = £, and r = c + e be a received vector from the 
transmission channel. Furthermore, let s,£ e N \ {0} be two 
parameters of the GSA with s < £. 



Problem 1 (Interpolation Step) Find a non-zero bivariate 
polynomial Q(x, y) = Qq(x) + Qx(x)y + ■ ■ ■ + Qt(x)y l over 



¥ q such that 



deg [Qy{x)\ < s(n - s) — 1 - v(k - 1) = d v 



(6) 



and 

Vj = o, . 

EE 

u—b fJ'—a 



1 and a, b € N, a + b < s 



i-b 



0, (7) 



(x,y) = (a-i , rj ) 



where Q v (x) = S„l <2 A 



The nested sum in (01 is called the (a,6)-th mixed partial 
Hasse derivative l20l of Q(x, y). The condition that all (a, b)- 
th Hasse derivatives with a + b < s evaluate to zero for all 



tuples (a 



0, ... ,n — 1, means by that these tuples 



are zeros of multiplicity s of Q{x,y). For that reason, we 
refer to the parameter s as the multiplicity of the GSA. It can 
be shown that the homogeneous linear system associated with 
Problem Q] has a non-zero solution (i.e., it has more equations 
than unknowns) as long as 

n{2£-s + l) £{k-l) 



e < e 



2(£ + l) 



2s 



A quick analysis shows that the system has ns(s+i)/ 2 
equations and Ylv=o (dv + 1) unknowns. Both numbers are 
exceedingly large even for short RS codes and intermediate 
parameters s and I. As a result, the time complexity of solving 
the system with Gaussian elimination is in O [s 3 £ 3 n 3 ] . 

After a solution of Problem Q] is found, the following 
problem must be solved. 

Problem 2 (Factorization Step) Given a solution Q(x, y) of 
Problem^ find all factors y — F(x) with deg [-F(ie)] < k. 

If we associate the at most I resulting polynomials F K (x) 
with padded vectors 

F K = (F Kt o, ■ ■ ■ , -F/t.fc-i, 0, ■ . ■ , ), 

n—k times 

then Definition |2] tells that f K = J^ 1 [F K ] E US. Since 
the parameters were chosen such that the GSA can correct at 
most Eo errors, the result list C of the GSA contains all / 
with dn [f K , r] < eq- It is proven in [Q that under all these 
assumptions c 6 C. Since C |< I, we refer to I as the list 
size of the GSA. 

Problem [2] can be solved with time complexity in 
O [l loglog[/]n 2 ] using a technique from [9], but this is not 
within the scope of this manuscript. 

It follows from the exposition of the Welch-Berlekamp 
algorithm in [3 1 and the interpretation of Justesen and H0holdt 
in 15] Sections 5.2 and 12.2] that the GSA simplifies to the 
Sudan algorithm if we restrict the multiplicity to s = 1 and 
that it further simplifies to the Welch-Berlekamp algorithm if 
we additionally restrict the list size to £ = 1. 



V. Compressing the Interpolation Step 

We will now investigate the GSA when it is applied to 
a modified received vector 3&j [r] regarding J . Doing so, 
we examine the Hasse derivative in (O for j e J . Slightly 
misusing mathematical notation, it becomes 

Vj e J and a, b G N, a + b < s : 



EE 

u—b fi—a 



0. 



meaning that the value of the inner sum is non-zero if and 
only if v = b. But this allows to write 

Vj e J and a, 6eN, a < s - b : 



E 

fi—a 



C) 


Qn, b x»- a 







= 



which means that a 3 is a root of multiplicity s — b — 1 of 
all Qb(x), < b < s. Let us define the polynomial 

V(x) ± l[(x - a~ j ). (8) 

With that we have proven the first part of our first main 
theorem: 



Theorem 3 LetQ(x,y) = Qo(x)+Qi(x)y^ VQi{x)y l be 

a solution of Problem\l\for a modified received vector S$j [r] 
with regard to J . Then the Qb(x), < b < s, can be written 
as 

Q b (x) = W b (x)V b (x), 

where V b {x) = V(x) s ~ b and V(x) is known and given by ([8} 
and 

deg[W b (x)] <d b -a{s-b). 

Proof: The second part follows from the simple observa- 
tion that deg [V(2;) s - b ] = a(s - b) and ©. ■ 

We will now interpret this result in the setting of the homo- 
geneous linear system associated with (0. The coefficients of 
Qb(x) = Y^jLo Qu.b^ L , b < s, are a subset of the system's 
solution variables and we know from the theorem that they 
can be written as a linear combination with known factors 
(the coefficients of V b (x) = X^=o~^ Vp.,b xfl ) an d unknown 
terms (the coefficients of W b {x) = Y^^o^'^ W^bX^). 

If it is known in advance that some of the solution variables 
of a system are linearly dependent, a simple and well-known 
trick can be applied. Due to space restrictions, we state this 
trick only by means of a small example. Assume that we have 
a linear system over an arbitrary field 

'a b c ' 
d e f 
\9 h i 

and know that a solution (x, y, z) T exists and z = ax + fly. 




Then, (x,y,z) T can as well be recovered by first solving 



a + ca b + cfi s 
d+fa e + f/3 
g + ia h + i/3 



for (x, y) T and then setting z = ax + /3y. Obviously, at least 
one column of the compressed coefficient matrix is linearly 
dependent on the others. 

Repeated and nested application of the trick basically al- 
lows to solve a linear system for the coefficients of the 
polynomials Wo (a;), • • ■ , W s -i(x), Q s (x), . . . , Qt(x) instead 
of Qo(x), . . . , Qi(x), meaning that the number of unknowns 
— and, in doing so, the system size — is diminished by 
J2b=o a ( s ~ This compression effect is maximized for 
larger a hence in a practical system one should always choose 
a = n — k. In that case, the number of linearly independent 
equations in the system is diminished from ns ( s + 1 )/2 to 
(n-fc)s(s+iy 2 . However, it is not a priori clear for general re- 
encoding which of the equations can be discarded due to linear 
dependence. 

Now consider the GSA when applied to iP p [r]. Recall that 
this is equivalent to general re-encoding with a = n — p and 
3 = {j ■ n /p/fi}- in that case, it follows from Lagrange's 
Theorem that for each root oT'i of V(x), its inverse a? 
is as well a root of V(x). Polynomials with this property 
are palindromic or self-reciprocal, i.e., the sequence of their 
coefficients is a palindrome. This allows to prove our second 
main theorem. 

Theorem 4 LetQ(x,y) — Qo(x)+Qi(x)y^ \-Qe{x)y e be 

a solution of Problem\l\for a modified received vector [r] 
with regard to p. Then Vb(x), < b < s, is palindromic with 
constant term 1 and it is sparse, with non-zero coefficients 
displaced by at least p. 

This structure of the Vb(x), < b < s, renders nested 
and repeated application of the trick as well as reconstruction 
of the complete polynomial Q(x,y) from the solution of the 
compressed system nearly trivial, because the coefficients Q^.b 
are either sparse linear combinations of the W M ,b, simple 
multiples of a W^f,, or even constant zero. Besides that, the 
palindromic structure yields that in the compressed coefficient 
matrix of the linear system, all rows i with ( n /p/i) mod n are 
zero, leaving only p s ( s + 1 )/2 of the original ns ( s + 1 )/2 equations. 

VI. Conclusion 

After a short recapitulation of the well-known re-encoding 
map, we have introduced the periodicity projection as a 
special case and have given its most important properties. 
An analysis of the interpolation step of the GSA has shown 
that applying either of the maps to the received vector results 
in a significant compression of the involved homogeneous 
system of linear equations. We have shown that compression 
and decompression are particularly simple in case of the 
periodicity projection, since the involved polynomials have 
large palindromic factors that can be calculated in advance. 



Besides further elaboration of the computational savings, it 
appears interesting to investigate the factorization step of the 
GSA when provided with the highly structured bivariate result 
polynomial in case of the periodicity projection. 
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